Fill target errors with nans for evaluate #41919

nagkumar91 · 2025-07-07T15:32:37Z

Description

Please add an informative description that covers that changes made by the pull request and link all relevant issues.

If an SDK is being regenerated based on a new swagger spec, a link to the pull request containing these swagger spec changes has been included above.

All SDK Contribution checklist:

The pull request does not introduce [breaking changes]
CHANGELOG is updated for new features, bug fixes or other significant changes.
I have read the contribution guidelines.

General Guidelines and Best Practices

Title of the pull request is clear and informative.
There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, see this page.

Testing Guidelines

Pull request includes test coverage for the included changes.

Add pyrit and not remove the other one

Copilot

Pull Request Overview

This PR enhances the target application step by filling any rows that fail execution with NaN values and logging a warning when such failures occur.

Adds a warning log when some target executions fail.
Reindexes the output to align with all input rows, filling missing ones with NaN.
Adjusts the concatenation order to ensure equal-length DataFrames.

Comments suppressed due to low confidence (2)

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_evaluate.py:620

The function's docstring should be updated to mention that rows failing target execution are reindexed and filled with NaN, and that a warning is logged when failures occur.

    if failed_lines > 0:

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_evaluate.py:638

No unit test currently covers the scenario where some rows fail and are filled with NaN. Consider adding a test that simulates partial failures, asserts NaN in the missing rows, and verifies the warning is logged.

    target_output = target_output.reindex(complete_index)

Nagkumar Arkalgud and others added 30 commits May 28, 2025 11:11

Prepare evals SDK Release

4318329

Fix bug

192b980

Fix for ADV_CONV for FDP projects

758adb4

Update release date

de09fd1

Merge branch 'main' into main

ef60fe6

Merge branch 'Azure:main' into main

8ca51d0

Merge branch 'Azure:main' into main

98bfc3a

Merge branch 'Azure:main' into main

a5f32e8

Merge branch 'Azure:main' into main

5fd88b6

Merge branch 'Azure:main' into main

51f2b44

Merge branch 'Azure:main' into main

a5be8b5

Merge branch 'Azure:main' into main

75965b7

Merge branch 'Azure:main' into main

d0c5e53

Merge branch 'Azure:main' into main

b790276

Merge branch 'Azure:main' into main

d5ca243

re-add pyrit to matrix

8d62e36

Change grader ids

59a70f2

Merge branch 'Azure:main' into main

4d146d7

Update unit test

f7a4c83

replace all old grader IDs in tests

79e3a40

Merge branch 'main' into main

588cbec

Update platform-matrix.json

7514472

Add pyrit and not remove the other one

Update test to ensure everything is mocked

28b2513

tox/black fixes

8603e0e

Skip that test with issues

895f226

Merge branch 'Azure:main' into main

b4b2daf

update grader ID according to API View feedback

023f07f

Update test

45b5f5d

remove string check for grader ID

1ccb4db

Merge branch 'Azure:main' into main

6fd9aa5

Nagkumar Arkalgud and others added 7 commits July 2, 2025 11:45

Update changelog and officialy start freeze

f871855

update the enum according to suggestions

59ac230

update the changelog

794a2c4

Finalize logic

b33363c

Merge branch 'Azure:main' into main

464e2dd

Fill the dataset when target doesn't respond with all columns

98dc816

Tox fixes

3943344

Copilot AI review requested due to automatic review settings July 7, 2025 15:32

nagkumar91 requested a review from a team as a code owner July 7, 2025 15:32

github-actions bot added the Evaluation Issues related to the client library for Azure AI Evaluation label Jul 7, 2025

Copilot AI reviewed Jul 7, 2025

View reviewed changes

Nagkumar Arkalgud added 7 commits July 7, 2025 13:51

Send dataframe instead of previous run

7504164

tox fixes

9f3d5bc

Add a test

610f97f

more fox fixes

330f653

Fix failing e2e test

91c0be9

Update regex to solve the column mapping

856d72a

Re add a validation step

2281478

nagkumar91 enabled auto-merge (squash) July 10, 2025 18:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fill target errors with nans for evaluate #41919

Fill target errors with nans for evaluate #41919

nagkumar91 commented Jul 7, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Fill target errors with nans for evaluate #41919

Are you sure you want to change the base?

Fill target errors with nans for evaluate #41919

Conversation

nagkumar91 commented Jul 7, 2025

Description

All SDK Contribution checklist:

General Guidelines and Best Practices

Testing Guidelines

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!